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NOVEL POLYMORPHIC MICROSATELLITE MARKERS 
IN THE HUMAN MHC CLASS H REGION 

FIELD OF THE INVENTION 
The present invention relates to novel polymorphic microsatellite markers in the 
human MHC class II region and methods for disease mapping and genotyping with said 
microsatellite markers. 

BACKGROUND OF THE INVENTION 
The human major histocompatibility complex (MHC) is positioned on the short 
arm of the 6th chromosome, band p21.3, and has been divided into three non-overlapping 
segments called class I, H, and IH (Campbell, D. and Trowsdale, J. (1997) Immunol. 
Today 18, 43; The MHC sequencing consortium (1999) Nature 401, 921-923). The HLA 
class n region can be largely subdivided into four subregions; DP, DO, DQ, and DR. 
The HLA class II genes display an extensive degree of genetic polymorphism and encode 
cell surface molecules that are involved in the presentation of exogenous antigens to the 
immune system (Kappes, D. and Strominger, J. L. (1988) Annu. Rev. Biochem. 57, 991- 
1028). Allelic variants of these class II genes are associated with a large number of 
diseases, e.g., rheumatoid arthritis (Shibue, T. et al. (2000) Arthritis and Rheumatism 43, 
753-757), insulin-dependent diabetes mellitus (IDDM) (Sanjeevi, C. B. (2000) Human 
Immunology 61, 148-153; Todd, J. A. et al. (1987) Nature 329, 599-604; She, J.-X. 

(1996) Immunol. Today 17, 323-329), IgA deficiency (Olerup, O. et al. (1990) Nature 
347, 289-290; Olerup, O. et al. (1992) Proc. Natl. Acad. Sci. USA 89, 10653-10657; Reil, 
A. et al. (1997) Tissue Antigens 50, 501-506), multiple sclerosis (Haegert, D. G. et al. 
(1989) J. Neurosci. Res. 23, 46-54; Haegert, D. G. and Francis, G. S. (1992) Hum. 
Immunol. 34, 85-90; Allen, M. et al. (1994) Hum. Immunol. 39, 41-48), idiopathic 
nephrotic syndrome (Konrad, M. et al. (1994) Tissue Antigens 43, 275-280), pemphigus 
vulgaris (Delgado, J. C. et al. (1996) Tissue Antigens 48, 668-672; Delgado, J. C. et al. 

(1997) Hum. hnmunol. 57, 110-119), and idiopathic nonobstructive azoospermia 
(Tsujimura, A. et al. (1999) J. Androl. 20, 545-550). 
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To date, at least 197 HLA-DRBl, 19-DQAl, 35-DQBl, 13-DPAl, and 83-DPBl 
alleles have been officially recognized (Marsh, S. G. E. (1998) Tissue Antigens 51, 467- 
507). The diversity of the HLA haplotypes in human populations has served as a useful 
landmark to roughly map disease-susceptibility loci in this region (Trowsdale, J. (1996) 
5 Molecular genetics of HLA class I and class 11 regions. In: Browning, M. J. and 
Mcmichael, A. J. eds., HLA and MHC genes, molecules and function. Oxford: Bios 
Scientific Publishers Ltd., 23-39; Hall, F. C. and Bowness, P. (1996) HLA and diseases: 
from molecular function to disease association? hi: Browmng, M. J. and Mcmichael, A. 
J. eds., HLA,, and MHC genes, molecules and function. Oxford: Bios Scientific 
10 Publishers Ltd. 353-381). However, only five genes, encoding the polymorphic HLA 
antigens (HLA-DRBl, -DQAl, -DQBl, -DPAl, and -DPBl), have so far been available 
as genetic markers in this region. Given this and the fact that the region spans over 
approximately 1.1 Mb and contains more than 30 functional genes (Forbes, S. A. and 
Trowsdale, J. (1999) Immunogenetics 50, 152-159; Beck, S. and Trowsdale, J. (1999) 
1 5 Immunological Reviews 167, 201-210), it is therefore difficult, if not unpossible, to 
precisely pinpoint most of disease susceptibility loci to their respective single genetic 
entities using only the available HLA class II diversity. 

This is mainly because of the tight linkage disequilibrium observed throughout 
the class 11 region. For example, IDDM was first reported to be associated with DR3 and 
20 DR4 in Caucasoids (Rotter, J. I. et al. (1983) Diabetes 32, 169-174). Since then, many 
studies using world-wide populations have shown associations not only with DRBl, but 
also with DQAl, DQBl, and DPBl alleles (Thompson, G. et al. (1988) Am. J. Hum. 
Genet. 43, 799; Renningen, K. S. et al. (1992) HLA class H associations m insulin 
dependent diabetes mellitus among Blacks, Caucasoids and Japanese, hi: Tsuji, K. et al. 
25 eds., HLA 1991, vol.1. Oxford: Oxford University Press; Caillat-Zucman, S. et al. (1997) 
Insulin dependent diabetes mellitus (IDDM): 12th hitemational Histocompatibility 
Workshop study. In: Charron, D. ed., HLA, vol. 1. France: EDK). The highest risk for 
developing the disease has been associated with the heterozygous DR3/DR4 phenotype, 
particularly in combination with DQA1*0501-DQB1*0201/DQA1*0301-DQB1*0302 
30 alleles in Caucasian populations (Owerbach, D. et al. (1983) Nature 303, 815-817; 

Amheim. N et al. (1983) Proc. Natl. Acad. Sci. USA 82, 6970-6974; Cohe-Haguenauer, 
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O. et al. (1985) Proc. Natl. Acad. Sci. USA 82, 3335-3339; Bohme, J. et al. (1986) J. 
Immunol. 137, 941-947; Festenstein, H. et al. (1986) Nature 322, 64-67; Nepom, B. S. et 
al. (1986) J. Exp. Med. 164, 345-350; Schreuder, G. M. et al. (1986) J. Exp. Med. 164, 
938-943; Tait, B. D. and Boyle, A. J. (1986) Tissue Antigens 28, 65-71), suggesting that 

5 HLA-DQ rather than -DR is involved in genetic predisposition to IDDM (57 DQB 1 non- 
Asp theory) (Todd, J. A. et al. (1987) Nature 329, 599-604). However, transracial studies 
have revealed that the susceptible molecules and the degree of their respective 
contribution appear to be different in various populations. In Chinese and Japanese, for 
instance, the DR3/DR4 heterozygous allele and the DR4 homozygous allele, respectively, 

10 are strong susceptible genotypes (Hu, C. Y. et al. (1993) Hum. hnmunol. 38, 105-114; 
Huang, H. S. et al. (1988) J. Formosan Med. Assoc. 87, 1-6; Huang, H. S. et al. (1992) J. 
Formosan Med. Assoc. 91, 233-236; Ju, L. Y. et al. (1991) Tissue Antigens 37, 218-223). 
Thus, it has been difficult to determine which one, DR or DQ locus, is the true 
pathogenic gene for IDDM. 

1 5 Microsatellites are tandemly repeated sequences of 2-6 bps which are widely 

dispersed throughout the human genome (Amos, W. and Rubinsztein, D. C. (1996) 
Nature genetics 12, 13-14; Edwards, A. I. et al. (1991) Am. J. Hum. Genet. 49, 746-756). 
They have been extensively used for linkage mapping as well as forensic and population 
studies (Bowcock, A. M. et al. (1994) Nature 368, 455-457; Brinkmann, B. et al. (1996) 

20 Hum. Genet. 98, 60-64). Polymorphism observed at these loci is due simply to variation 
in the number of repeats of a single unit; the so-called stepwise model. Valdes, A. M. et 
al. (1993) Genetics 133, 737-749; Levinson, G. and Gutman, G. A. (1987) Mol. Biol. 
Evol. 4, 203-221). 

Previously, 38 polymorphic microsatellite repeats in the HLA class I region were 
25 collected (Tamiya, G. et al. (1998) Tissue Antigens 51, 337-346; Tamiya, G. et al. (1999) 
Tissue Antigens 54, 221-228). These microsatellites were subsequently used for 
association mapping of HLA class I associated diseases leading, among others, to a 
successful narrowing of critical regions for Beh9et's disease and psoriasis vulgaris to 
approximately 50 kb between the MICA and HLA-B genes, and between the P0U5FI 
30 and S genes, respectively (Ota, M. et al. (1999) Am. J. Hum. Genet. 64, 1406-1410; Oka, 
A. et al. (1999) Hum. Mol. Genet. 88, 2165-2170). 
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In the HLA class II region, however, no polymorphic microsatellite repeat has 
been identified yet. Therefore, it is still unavailable to map susceptibility genes for 
diseases associated with HLA class II alleles. 

5 SUMMARY OF THE INVENTION 

An objective of the present invention is to provide novel polymorphic 
microsatellite markers in the human MHC class II region and methods for disease 
mapping and genotyping with said microsatellite markers. These novel polymorphic 
microsatellites will provide useful genetic markers in HLA-related research, such as 

1 0 genetic mapping of HLA class II associated diseases, transplantation matching, 

population genetics, and identification of recombination hot spots as well as linkage 
disequilibrium studies. 

The present inventors have analyzed 2-'5 base short tandem repeats 
(microsatellites) in the genomic sequence of the HLA class II region (1 . 1 Mb) by the 

1 5 computer program of Abajian (http://vww.abajian.com/sputnik/) to identify a total 
number of 494 microsatellites from the genomic sequence. From among them, the 
present inventors selected microsatellites with more than 10 repeats for di-nucleotide 
repeats and with more than 5 repeats for tri-, tetra-, and penta-nucleotide repeats to obtain 
145 microsatellites. 

20 The present inventor, then, randomly chosen 41 out of the 145 microsatellite 

repeats mentioned above and predicted, by a rough survey using 8 Japanese HLA 
homozygous B-cell Unes, that 31 out of these 41 microsatellite repeats should be quite 
polymorphic. Furthermore, the present inventors investigated allele frequencies and 
heterozygosities of 21 out of these 31 microsatellite repeats using 190 unrelated Japanese 

25 individuals by the polymerase chain reaction (PGR) combined with fluorescent-based 
automated fragment technology. Finally, the present inventors obtained 21 novel 
polymorphic microsatellite markers with high number of alleles and high polymorphism 
information content (PIC) to accomplish the present invention. 

Namely, the present invention relates to novel polymorphic microsatellite markers 

30 in the MHC HLA class 11 region and methods for disease mapping and genotyping with 
said microsatellite markers. More specifically, the present invention relates to: 
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1) An oligonucleotide primer, wherein said primer is capable of specifically 
hybridizing to a DNA having the sequence of the flanking regions of a microsatellite 
selected from the group consisting of M2_4_9, M2_2_9, M2_2_12, M2_3_ll, M2_2_20, 

5 M2_2_2 1 , M2_2_22, M2_2_23, M2_2_24, M2_4_25, M2_4_26, M2_2_29, M2_2_32, 
M2_4_32, M2_4_33, M2_4_37, M2_3_22, M2_2_36, M2_5_l 1, M2_2_46, and 
M2_2_48. 

2) The oligonucleotide primer according to 1), wherein the sequence of said primer 
is selected from the group consisting of SEQ ID NOs: 1-42. 

10 3) A kit for determining the number of repeat units of a microsatellite selected from 
the group consisting of M2_4_9, M2_2_9, M2_2_12, M2_3_ll, M2_2_20, M2_2_21, 
M2_2_22, M2_2_23, M2_2_24, M2_4_25, M2_4_26, M2_2_29, M2_2_32, M2_4_32, 
M2_4_33, M2_4_37, M2_3_22, M2_2_36, M2_5_l 1, M2_2_46, and M2_2_48, the kit 
comprising a pair of oligonucleotide primers having the sequence of the flanking regions 

15 of said microsatellite. 

4) The kit according to 3), comprising a pair of oligonucleotide primers selected 
from the group consisting of 

(a) SEQ ID NO: 1 and SEQ ID NO: 2, 

(b) SEQ ID NO: 3 and SEQ ID NO: 4, 
20 (c) SEQ ID NO: 5 and SEQ ID NO: 6, 

(d) SEQ ID NO: 7 and SEQ ID NO: 8, 

(e) SEQ ID NO: 9 and SEQ ID NO: 10, 

(f) SEQ ID NO: 1 1 and SEQ ID NO: 12, 

(g) SEQ ID NO: 13 and SEQ ID NO: 14, 
25 (h) SEQ ID NO: 1 5 and SEQ ID NO: 1 6, 

(i) SEQ ID NO: 17 and SEQ ID NO: 18, 
G) SEQ ID NO: 19 and SEQ ID NO: 20, 
(k) SEQ ID NO: 21 and SEQ ID NO: 22, 
(1) SEQ ID NO: 23 and SEQ ID NO: 24, 
30 (m) SEQ ID NO: 25 and SEQ ID NO: 26, 
(n) SEQ ID NO: 27 and SEQ ID NO: 28, 
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(o) SEQ ID NO: 29 and SEQ ID NO: 30, 
(p) SEQ ID NO: 31 and SEQ ID NO: 32, 
(q) SEQ ID NO: 33 and SEQ ID NO: 34, 
(r) SEQ ID NO: 35 and SEQ ID NO: 36, 
5 (s) SEQ ID NO: 37 and SEQ ID NO: 3 8, 
(t) SEQ ID NO: 39 and SEQ ID NO: 40, and 
(u) SEQ ID NO: 41 and SEQ ID NO: 42. 

5) A method for determining the number of repeat units of a microsatellite, the 
method comprising a step for determining the number of repeat units in the region of 

1 0 which DNA can be amplified by using the kit according to 4). 

6) A method for mapping of susceptibility genes for disease associated with HLA 
class n alleles, by using a microsatellite marker selected from the group consisting of 
M2_4_9, M2_2_9, M2_2_12, M2_3_l 1, M2_2_20, M2_2_21, M2_2_22, M2_2_23, 
M2_2_24, M2_4_25, M2_4_26, M2_2_29, M2_2_32, M2_4_32, M2_4_33, M2_4_37, 

1 5 M2_3_22, M2_2_36, M2_5_l 1 , M2_2_46, and M2_2_48, the method comprising: 

(a) determining the number of repeat units of said microsatellite, 

(b) estimating the allele frequencies of patients and controls, based on said number, and 

(c) comparing the allele frequencies of patients with those of controls. 

7) The method according to 6), the method comprising: 

20 (a) amplifymg a region of microsatellite using the oligonucleotide primer according to 1 ) 
or 2), 

(b) determining the number of repeat units of said microsatellite, 

(c) estimating the allele frequencies of patients and controls, based on the number, and 

(d) comparing the allele frequencies of patients with those of controls. 

25 8) A method for genotyping of a microsatellite allele selected from the group 

consisting of M2_4_9, M2_2_9, M2_2_12, M2_3_l 1, M2_2_20, M2_2_21, M2_2_22, 
M2_2_23, M2_2_24, M2_4_25, M2_4_26, M2_2_29, M2_2_32, M2_4_32, M2_4_33, 
M2_4_37, M2_3_22, M2_2_36, M2_5_l 1, M2_2_46, and M2_2_48, the method 
comprising: 

30 (a) amplifying a region of the microsatellite, and 

(b) determining the number of repeat units of said microsatellite. 
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9) The method according to 1\ wherein said amplifying is performed by using the 
oligonucleotide primer according to 1) or 2). 



BRIEF DESCRIPTION OF THE DRAWINGS 
5 Figure 1 shows the gene map and location of microsatellite repeats in the HLA 

class II region. The top line indicates the scale of the entire HLA region and the map 
position of representative HLA antigen genes, HLA-DP, -DQ, -DR, -B, -C, -E, -J, -A, -G, 
and -F. Rectangles on the second line indicate the already known genes in the HLA class 
II region. Arrows show the transcriptional orientation of these genes. The bottom line 
1 0 indicates the location of the polymorphic microsatellite markers of the present invention. 

DETAILED DESCRIPTION OF THE INVENTION 
In the present invention, the term "microsatellite" means a 2-5 base short tandem 
repeat in a polynucleotide sequence. The microsatellites are classified into the following 
1 5 three kinds of repeats: perfect repeat, imperfect repeat, and compound repeat. A perfect 
repeat is defined as a tandem repeat without interruption and without adjacent repeats of 
another sequence. An imperfect repeat is defined as two or more runs of uninterrupted 
repeats separated by nonrepeat bases. A compound repeat corresponds to those 
containing stretches of two or more different repeats. Preferably, the microsatellite of the 
20 present invention is a perfect or imperfect repeat. 

The microsatellite of the present invention is named as "M2_n_m", where "M2" 
represents the serial number of temporary consensus genomic sequence, "n" indicates the 
number of nucleotides in repetition units {l-5\ and "m" represents a serial number. 

Microsatellite loci that are useful in the present invention will have the general 
25 formula: 

L(M)nR 

where L and R are non-repetitive flanking sequences that uniquely identify the particular 
30 locus, M is a repeat motif, and n is the number of repeats. The locus may be present 
inside or outside coding region of genes on a human chromosome. 
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The flanking sequences L and R uniquely identify the microsatellite locus within 
the human genome. L and R will be at least about 1 8 nucleotides in length, and may 
extend a distance of several thousand bases. The DNA having L and R sequences may be 
obtained in substantial purity as restriction fragments, amplification products, etc., and 
will be obtained as a sequence other than a sequence of an intact chromosome. Usually, 
the DNA will be obtained substantially free of other nucleic acid compounds which do 
not include a microsatellite sequence or fragment thereof, generally being at least about 
50%, usually at least about 90% pure and are typically "recombinant", i.e., flanked by 
one or more nucleotides with which they are not normally associated on a natural 
chromosome. 

Within the flanking sequences L and R, sequences will be selected for 
amplification primers Pl and Pr. The exact compositions of the primer sequences are not 
critical to the invention, but Pl and Pr must hybridize to the flanking sequences L and R 
respectively, or complementary strand thereof, under stringent conditions. Conditions for 
stringent hybridization are known in the art, for example, one may use a solution of 
5xSSC and 50% formamide, incubated at 42°C, preferably 50°C or 65°C. To maxunize 
the resolution of size differences at the locus, it is preferable to chose a primer sequence 
that is close to the repeat sequence, usually within at least about 100 nucleotides of the 
repeat, more usually at least about 50 nucleotides, and preferably at least about 
25 nucleotides. Algorithms for the selection of primer sequences are generally known, 
and are available in commercial software packages. The primers will hybridize to 
complementary strands of chromosomal DNA, and will prime towards the repeat 
sequences, so that the repeats will be amplified. The primers will usually be at least 
about 18 nucleotides in length, and usually not more than about 35 nucleotides in length. 
Primers may be chemically synthesized in accordance with conventional methods or 
isolated as fragments by restriction enzyme digestion, etc. 

The term "polymorphic" means that, n, Hie number of repeat motif M at a specific 
locus, is variable in a population. Therefore, the polymorphisms of a microsatellite are 
represented as the differences in the length of DNA that Ues between the flanking 
sequences L and R. The differences can be detected by amplifymg a region of the 
microsatellite using suitable primers, size fractionating the amplified products by a 
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denaturing polyacrylamide gel electrophoresis, and comparing the size of the amplified 
products. It is expected that microsatellites with more than 10 repeats for di-nucleotide 
repeats and with more than 5 repeats for tri-, tetra-, and penta-nucleotide repeats display a 
high degree of repeat polymorphism (Weber, W. J. L. (1990) Genomics 7, 524-530). 
5 When the observed frequencies of i th and j th alleles at a given microsatellite 

locus are represented as pi and pj, heterozygosity (Ht) is calculated with: 

Ht=l-Zpi^ 

10 and polymorphism information content (PIC) value is calculated with: 

PIC = Ht-SS2piV- 

Higher Ht and PIC values in the population reflect the higher degree of variability 
] 5 within the locus. In the present invention, tiie Ht value is preferably at least 0.5 and is 
more preferably at least 0.7, and PIC value is preferably at least 0.25 and is more 
preferably at least 0.5. 

The present invention relates to oligonucleotide primers capable of specifically 
hybridizmg to the flanking regions of the following 21 microsatellites: 



M2_4_9 


M2_2_23 


M2_4_33 


M2_2_9 


M2_2_24 


M2_4_37 


M2_2_12 


M2_4_25 


M2_3_22 


M2_3_ll 


M2_4_26 


M2_2_36 


M2_2_20 


M2_2_29 


M2_5_ll 


M2_2_21 


M2_2_32 


M2_2_46, and 


M2_2_22 


M2_4_32 


M2_2_48 



"Specifically hybridizing" means that there is no significant cross-hybridization to 
30 imrelated regions of the genome under an ordinary hybridization conditions, and 

preferably under a stringent hybridization conditions. The microsatellites of the present 
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invention comprises the respective repeat units indicated in Table 3. "Flanking regions of 
a microsatellite" are regions located upstream and downstream of each repeat unit, which 
is in between the two regions. Namely, the above-mentioned microsatellite is defined as 
a region comprising the repeat unit shown in Table 3 and existing in the genome in 

5 between the flanking regions indicated in Table 3. It is to be noted that, in Table 3, the 
antisense-strand nucleotide sequences of the flanking regions are indicated as 5-3' 
direction. The oligonucleotide primer of the present invention comprises a nucleotide 
sequence complementary to the sequence of either of the flanking region, or the 
complementary strand thereof; and preferably the primer has 18 nucleotide residues or 

10 more. The term "complementary strand" here indicates opposite strand to one strand of a 
DNA duplex consisting of A/T (U in the case of RNA) and G/C base pairs. 
"Complementary" means not merely being fully complementary in the region with at 
least 18 consecutive nucleotides but also being homologous in at least 70% of the 
nucleotides, preferably in at least 80% of the nucleotides, more preferably in at least 

1 5 90%, yet more preferably in 95% or more of the nucleotides. Nucleotide sequence 
homology is determined by using a publicly known algorithm such as BLASTIN. 
Preferable nucleotide sequences of the oligonucleotide primers of the present invention 
are shown in SEQ ID NOs: 1-42. The relation between each SEQ ID NO and the 
microsatellite sequence is indicated in Table 3. Each nucleotide sequence of SEQ ID 

20 NOs: 1-42 is just an example; and the oligonucleotide primers of the present invention 
shoxxld be construed as not to be Umited to the nucleotide sequences illustrated. 
Therefore, the oUgonucleotide primers of the present invention include any 
oligonucleotide primers capable of amplifying regions containing the full-length repeat 
units amplified by using the oligonucleotide primers indicated in Table 3. The number of 

25 repeat units consisting microsatellites can be determined by amplifying the repeat units 
with the oligonucleotide primers of the present invention. 

Any suitable amplification procedure known to one skilled in the art, such as, but 
not limited to, polymerase chain reaction (PGR), Q|3 replication, isothermal sequence 
replication, or ligase chain reaction may be used. However, the most developed and well 

30 understood amplification systems are PGR systems. Thus, PGR is currently the preferred 
method of ampHfication. Suitable reaction conditions for PGR are described in Saiki et 
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al (1985) Science 239, 487, and Sambrook, et al. (1989) Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, 14.2-14.33. 

Conveniently, a detectable label will be included in the amplification reaction. 
Suitable labels include fluorochromes, e.g., fluorescein isothiocyanate (FITC), 

5 rhodamine, Texas Red, phycoerythrin, allophycocyanin, 6-carboxyfluorescein (6-FAM), 
2\T -dimethoxy-4 ' ,5 ' -dichloro-6-carboxyfluorescrin (JOE), 6-carboxy-X-rhodamine 
(ROX), 6-carboxy-2\4',7',4,7-hexachlorofluorescein (HEX), 5-carboxyfluorescein 
(5-FAM), N,N,N',N'-tetramethyl-6-carboxyrhodamine (TAMRA), 6-carboxy-4,7,2',7'- 
tetrachloro-fluorescein (TET), or New Electrophoresis Dye (NED); radioactive labels, 

10 e.g., 32P, 35S, 3H; etc. The label may be a two-stage system, where the amplified DNA 
is conjugated to biotin, haptens, etc., having a high affinity binding partner, e.g., avidin, 
specific antibodies, etc., where the binding partner is conjugated to a detectable label. 
The label may be conjugated to one or both of the primers. Alternatively, the pool of 
nucleotides used in the amplification may be labeled, so as to incorporate the label into 

1 5 the amplification product. 

Detection and size determination of amplification products such as PCR products 
fi:om a specific microsatellite locus can be accomplished by several means. In one 
embodiment, amplification products are labeled with ^^P, size firactionated by a 
denaturing polyacrylamide gel electrophoresis, and visualized by autoradiography. In 

20 another embodiment, the amplification products are labeled with a fluorochrome, and 
separated on an automated DNA sequencing apparatus. The automated sequencer is 
particularly usefiil with multiplex amplification. Another method separates the 
amplification products by capillary electrophoresis, which has the advantage of being 
much faster than acrylamide gel electrophoresis while maintaining the accuracy of sizing. 

25 A review of capillary electrophoresis may be found in Landers et al. (1 993) 
BioTechniques 14, 98-1 1 1 . 

Simultaneous analysis can be performed for multiple, different type 
microsatellites in the present invention. To achieve this, respective primer sets for 
amplifying multiple microsatellites are pre-labeled with different labels. The resulting 

30 amplification products obtained are fractionated, for example, by capillary 
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electrophoresis, and lengths of the fragments are determined for each label, thereby 
achieving the simultaneous analysis of multiple, different type microsatellites. 

The size of the amplification product is proportional to n, the number of repeats 
that are present at the locus specified by the primers. The size will be polymorphic in the 
5 population, and is therefore an allelic marker for that locus. 

A kit may be provided for practice of the present invention. Such a kit will 
contain at least one set of oligonucleotide primers of present invention, useful for 
amplifying microsatellite DNA repeats. The primers may be conjugated to a detectable 
label. 

10 The present invention also relates to a method for genotyping comprising the 

following steps (a) and (b): 

(a) amplifying a region of the microsatellite, and 

(b) determining the number of repeat units of said microsatellite. 

15 

The present invention further relates to a method for mapping of susceptibility 
genes for disease associated with HLA class II alleles, comprising the following step (c): 

(c) estimating the allele frequencies of patients and controls, based on the number, 
20 and the method of the present invention preferably comprises the following step (d): 

(d) comparing the allele frequencies of patients with those of controls. 

DNA corresponding to a region of the microsatellite is amplified from a human 
genomic DNA sample by a publicly known method. The above-mentioned microsatellite 

25 can be amplified with the oligonucleotide primers of the present invention by using total 
genomic DNA or purified DNA containing the HLA class II region as a template. The 
number of repeat units in the amplification products is determined according to the 
method as described above. The number of repeat units of a microsatellite represents a 
genotype of the individual from which the genomic DNA has been derived. 

30 Analysis for establishing a link between the thus determined genotype and a 

specific phenotype is called "linkage analysis." By elucidating the association with 
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susceptibility genes for a disease, it is possible to clarify where the gene is located in the 
HLA class II region, in particular. Identifying the genomic location of a particular gene 
is called "mapping." The mapping is performed by accumulating information on the 
frequency of each genotype in a population with a hereditary character whose association 

5 with the gene is to be analyzed and by revealing the relationship between the hereditary 
character and the genotype. 

Each genotype frequency in a population is herein designated as "allele 
frequency," In general, when the frequency of a genotype is significantly high in a 
population with a particular hereditary character, it can be assumed that the microsatellite 

10 corresponding to a genotype is located in the genome in the vicinity of the causative gene 
for the phenotype. The analysis can be carried out, for example, as follows: first, a 
particular disease is specified for testing, and then microsatellite analysis is carried out to 
identify the genotype. The same analysis is performed in a group of normal healthy 
persons. Frequency of the genotype is compared between the two groups. When there is 

15 a significant difference in the genotype frequency between the two, then the disease is 
assumed to be associated with the number of repeat units of a microsatellite representing 
the genotype. Further, mode of inheritance of the susceptibility genes for the disease can 
be estimated by retrospective pedigree analysis for the association of the disease with the 
genotype. 

20 The above-mentioned microsatellites are located in the HLA class II region. 

Genes playing important roles in the immunological system have been mapped in the 
HLA class II region. Many disease-associated genes previously reported have also been 
mapped in this region. Thus, microsatellites located in the HLA class II region and 
giving enough PIC are useful markers for linkage analysis for a variety of hereditary 

25 characters. By using the 21 microsatellites disclosed in the present invention, the HLA 
class n region with an overall length of about 1.1. Mb is examined at average resolution 
of 52 Kb by linkage analysis. The present invention is greatly significant providing 
microsatellites enabling such high-resolution analysis of the HLA class II region, where 
important information is believed to be contained densely, 

30 The present invention is illustrated in more detail with reference to following 

EXAMPLES, but is not to be construed as being Umited thereto. 
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EXAMPLE 1 Detection of microsatellite repeats in the HLA class II region 

The entire sequence of the HLA class II region from the HSET to TSBP genes 
(Figure 1) (The MHC sequencing consortium (1999) Nature 401, 921-923) was retrieved 

5 from the database (http://ww.sanger.ac.xik/HGP/Chr6/MHC.shtml). To detect 

microsatellites with di- to penta-nucleotide repeats in this approximately 1.1 Mb region, 
the genomic sequence was subjected to microsatellite detection analysis by the computer 
program Sputnik (Abajian, http://www.abajian.com/sputnik/). Of the detected 
microsatellites, those di-nucleotide repeats carrying more than 10 repeat units and those 

1 0 tri-, tetra-, and penta-nucleotide repeats with over 5 repeat xmits defined the final 

selection as these were expected to display a high degree of polymorphism (Weber, W. J. 
L. (1990) Genomics 7, 524-530). 

EXAMPLE 2 Identification of microsatellite repeats in the HLA class II region 
1 5 Microsatellite repeats identified in the HLA class II region (1 . 1 Mb from the 

HSET to TSBP genes. Figure 1) (The MHC sequencing consortium (1999) Nature 401, 
921-923) amounted to 494 in total, consisting of 158 di-, 65 tri-, 163 tetra-, and 108 
penta-nucleotide repeats (Table 1). Four tri-nucleotide repeats are localized inside the 
coding sequences of fimctional genes. The exon 4 of the Daxx gene included a 
20 microsatellite repeat M2_3_3, consisting of (GAG)5, which encodes polyglutamic acids. 
Another microsatellite M2_3_4, (GAG)2GAA(GAG)3, localized in the exon 1 sequence 
of the BINGl gene, also encodes polyglutamic acids. The RXRB gene contained 
M2_3_8, (GCG)6, which gives rise to polyalanines, in exon 1 . The first exon of the 
COLl 1A2 gene possessed M2_3_10, (CTC)4, which encodes polyleucines. Among 
25 them, the three microsatellite repeats, M2_3_3, M2__3_4, and M2_3_l 0, did not exhibit 
any repeat polymorphism. 
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Table 1 


I Microsatellites in the HLA class II region 


nucleotide repeat 


total 


>5 repeats >10 repeats 


>20 nucleotides 


di 


158 


51 


51 


tri 


65 


28 


8 


tetra 


163 


54 


54 


penta 


108 


12 


27 


total 


494 


94 51 


140 






145 





According to the criteria that microsatellites with more than 10 repeats for di- 
5 nucleotide repeats and with more than 5 repeats for tri-, tetra-, and penta-nucleotide 
repeats are expected to display a high degree of repeat polymorphism (Weber, W. J, L. 
(1990) Genomics 7, 524-530), 51 di-, 28 tri-, 54 tetra-, and 12 penta-nucleotide repeats 
(in total, 145) were selected among the total 494 microsatellites contained in the class II 
region. These are summarized in Table 1 and include 94 perfect repeats, 46 imperfect 
10 repeats, and five compound repeats (Table 2). The bulk of these microsatellite consisted 
of perfect repeats as compound repeat sequences were relatively rare. 



Table 2 Repeat units of 145 microsatellites in the HLA c 


ass II region 


nucleotide repeat 


perfect 


imperfect 


compound 


total 


di 


34(13) 


13(1) 


4(3) 


51 (17) 


tri 


23(1) 


5(1) 


0(0) 


28(2) 


tetra 


32(6) 


21(4) 


1(1) 


54(11) 


penta 


5(0) 


7(1) 


0(0) 


12(1) 


total 


94 (20) 


46(7) 


5(4) 


145(31) 



15 ( ): the number of polymorphic microsatellites (see text). 

EXAMPLE 3 Isolation of human genomic DNA 

A total of 190 unrelated healthy Japanese blood donor volunteers were enrolled in 
20 the present invention. Genomic DNAs were isolated from lymphoblastoid cell lines or 
peripheral blood leukocytes by phenol extraction after lysis with proteinase K and 0.5% 
sodium dodecyl sulfate (SDS) (Inoko, H. et al. (1986) Hum. Immunol. 16, 304-312). 
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EXAMPLE 4 Detection of microsatellite polymorphism 

Out of those 145 microsatelUte repeats, 41 repeats were randomly chosen and 
investigated as to the degree of repeat polymorphism. To roughly survey the degree of 
repeat polymorphism of these microsatellite repeats, the size of PGR amplified products 
5 was investigated by the fluorescent-based genotyping method using human genomic 
DNAs derived from 8 Japanese using HLA homozygous B-cells lines. 

The fluorescent-based genotyping method is as follows. Fluorescent dye- 
conjugated PGR primers were unilaterally labeled at the 5'-end with the fluorescent 
reagent, 6-FAM, HEX, TET, or NED (PE biosystems Japan Go. and GENSET SA). PGR 

10 amplification of microsatellites was carried out in a 20 ^il PGR reaction containing 2 jxl of 
dNTP (2.5 mM each), genomic DNA (5 pil; 2 ng/^l), 2 ^il of lOxbuffer (100 mM Tris- 
HGl, pH 8.3, 500 mM KGl, and 15 mM MgGb), 20 pmol of forward and reverse primers, 
and 0.5 U TaKaRa recombinant Taq polymerase (Takara Shuzo Go.) in an automated 
thermal cycler (PE biosystems Japan Go.). PGR reaction conditions were as follows: 

1 5 after initial denaturation for 5 min at 96°G, annealing for 1 min at 56^G, and extension 
for 1 min at 56°G, amplifications were processed through 30 temperature cycles 
consisting of 45-sec denaturation at 96°G, 45-sec annealing at 56°G, and 1-min extension 
at 72°G with a final extension of 7 min for 72°G. Each PGR product was denatured for 5 
min at 96°G, pooled, mixed with formamide-containing loading buffer, and then 

20 separated on 4% polyacrylamide denaturing gels containing 8 M urea with a size standard 
marker of GS500 TAMURA (PE biosystems Japan Go.) using an ABI 377 automated 
sequencer XL. 

Thirty-one of the above-mentioned 41 microsatellite repeats (76%) were predicted 
to be quite polymorphic in the Japanese population by a rough survey using 8 Japanese 
25 HLA homozygous B-cells lines (Table 2). 

EXAMPLE 5 Estimation of allele fi:equencies of microsatellite repeats 

To examine allele fi'equencies of these 31 polymorphic microsatellite repeats by 
direct counting, 21 of them were subjected to fluorescent-based genotyping using 
30 genomic DNAs fi-om 190 normal Japanese healthy blood donor volunteers. The PGR 
reaction was carried out in a 96-well plate and PGR products were run with a size 
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standard marker of GS500 ROX (PE biosystems Japan Co) using an ABI 3700 automated 
sequencer. Other conditions were the same as described in EXAMPLE 4. 

Observed hetero2ygosity, expected heterozygosity, and the polymorphism 
information content (PIC) value, which are contingent on the number of alleles in 
5 samples and the sample size, were calculated from the observed frequencies in the 
population. Observed heterozygosity was calculated with: 

Ht(Obs) = HnAVn, 

1 0 Where Hn is the nxmiber of individuals that show heterozygosity at a given microsatellite 
locus, and Wn is the total niimber of individuals whose alleles at a given locus were 
examined, expected heterozygosity (Ht) was calculated with: 
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Ht=l -Spi^ 



and PIC was calculated with: 



PIC = Ht-SS2piV 



20 where pi and pj are the observed frequencies of i th and j th alleles at a given 
microsatellite locus. 

Information on these 21 markers, including localization, repeat unit, allele 
number, and size range as well as heterozygosity values, PIC, and amplification PCR 
primers, are listed in Table 3. Heterozygosity was in the range of 0.03 to 0,94 with an 

25 average of 0.58. The number of alleles ranged from 2 to 28 v^th an average of 11 .38. 
The PIC value was between 0.03 and 0.94 with 0.57 on average. These 21 new 
polymorphic microsatellite markers are almost uniformly interspersed, approximately 
every 49 kb on average within the HLA class n region (Figure 1). 
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Table 3 Characteristics of microsatellite markers 



PGR primers 


TTCC: GGGATTGATTCCAAAACCC 
6GAA: GAGATCAAGACCATCCTGGC 


TC: TGTTTGCCAGGAACTGTGC 
GA: ACTATGCAGCATCCAAGGC 


TA: TTGCAAATACGATGTCGAAGG 
AT: AAACCTCCTAACCTCTGTGACC 


TCT: GAGGTCAGCTTCCTCAAATAGG 
AGA: CCCACACCTGTAATCTTAGTGC 


TA: CCACTCCCATCTTATAGTTGTGTC 
AT: AATTCCATTCGCCCAGAG 


AT: CCAATGTTTGATAGCAGACTGG 
TA: CCTAGAGATTCCTCCGTATTAGTTC 


GT: GGA6ACACATTCAAACCATAGC 
AC: CAATTGGTGACATACATCAACTTG 






CO ^ 


VO CO 


OO 


OS S 




CO 


^ — ' O-i 

SIS as 


0.58 
0.45 
0.56 


0.57 
0.58 
0.51 


0.79 
0.80 
0.77 


vo 
<N1 t^- 

GOO 


0.89 
0.35 
0.88 


0.79 
0.43 
0.77 


0.81 
0.78 
0.78 


No. of 
alleles'' 
Range (bp) 


369-475 


359-371 


17 

455-499 


14 

423-454 


15 
393-435 


15 
292-336 


10 
197-219 


Repeat unif^ 
Original allele 
Clone 


(TTCC)3(TTCCC)2(nCC)8 
allele 465 
dJ1033B10 


(TC)13 
allele 363 
dJ1033B10 


(TA)14 
allele 465 
dJ1033B10 


(TCT)4T3(TCT)4 
allele 429 
dJ1033B10 


(A)33(TA)15 
allele 408 
019A 


{AT)25 
allele 339 ' 
014 


(6T)14 
allele 205 
014 


Localization'' 


Tel. (20kb)/ AREl- 
hom 

Cen. (18kb)/ RINGl 


IN: RING2 


Tel. (15kb)/C0LIA2 
Cen. (63kb)/ DPBl 


Tel. (21kb)/C0LIA2 
Cen. (57kb)/ DPBl 


Tel. (41kb)/ DPAl 
Cen. (18kb)/ DNA 


Tel. (7kb)/ DNA 
Cen. (18kb)/ RING3 


Tel. (llkb)/ DNA 
Cen. (14kb)/ RING3 


Micro- 
satellite' 


M2_4_9 


M2_2_9 


oo 

1 

2 


M2_3_ll 


I 

csa 


oo 





continued 
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Table 3 Characteristics of microsatellite markers (continued) 



PGR primers 


CA: TTGCATACACTCT6AAGCAGC 
16: TCCCTGTGGATGTCAAGAATC 


6T: GAATGGATGCTGCATGAGG 
AC: AAGTGTTGAAG6AACTCCCTGC 


TATC: TCACTCATGGTTGCTTTTCC 
GATA: 6AATGATAGGAGTCCATTGTGG 


AATA: TTGTGGTTTCAGCTACTCAGG 
lATT: TTCTnCATTTGGCCTCTACTG 


TC: TACCTTATCATTACCGGAATGC 
6A: CGCTGGACCAGAAAGTTAGG 


AT: GGCAGCAGAATGAGACTCTG 
AT: ACGTCCCATGAGGACAGG 


GAAG: GTTCTGGAGATCTGTGGTGG 
CTTC: GGACTCCAGTTTCAATGCC 


go 


to CO 

1— 1 f-H 


OO 


03 CZ5 


CV3 CM 


CO ^ 
CM CM 


to CO 
CM CM 


t— OO 
CM CM 


HT (Exp) 


0.76 
0.73 
0.72 


0.26 
0.28 
0.24 


0.69 
0.70 
0.64 


0.24 
0.25 
0.22 


0.03 
0.03 
0.03 


0.50 
0.47 
0.47 


0.47 
0.83 
0.78 


No. of 

alleles'* 
Range (bp) 


289-297 


433-437 


189-213 


183-200. 


370-380 


154-164 


318-396 


Repeat unit' 
Original allele 
Clone 


(CA)13 
allele 293 
027 


(6T)11 
allele 437 
HA14-I1I802 


(TATC)12 
allele 201 
HA14-III802 


(AATA)6 
allele 183 
HA14-III802 


(TC)4TT(TC)6 
allele 380 
HA14-III802 


(AT)6AC(AT)5(GT)5 
allele 160 ' 
DV19F1121 


(GAAG)IO 
allele 380 

Fim 


Localization'' 


IN: RING3 


Tel. (2kb)/ DMB 
Cen. (73kb)/ LMP2 


Tel. (49kb)/ DMB 
Cen. (27kb)/LMP2 


Tel. (50kb)/ DMB 
Cen. (25kb)/LMP2 


Tel. (7kb)/ TAP2 
Cen. (4kb)/ DOB 


Tel. (97kb)/ DOB 
Cen. (50kb)/ DQBl 


Tel. (21kb)/ DQB3 
Cen. (62kb)/ DQAl 


Micro- 
satellite' 


CO 

2 


^1 

1 

ad 


to 

^1 


CO 


M2_2_29 


M2_2_32 


CM 

CO 
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Table 3 Characteristics of microsatellite markers (continued) 



PGR primers 


HTA: TCATTATCCCCAGTTCAATGAC 
TAAA: GGGACAGAGCGAGACTCTG 


STTC: AATGAGGTAATATAGGAAGCAGTGG 
GAAA: TTTGTTCCTGGTCTCGCTC 


TTA: TGCACATAGAGAGCTCCAATC 
ATT: AGGCA6GAGGTTTGCTTG 


GT: ACTGCAGACACAACTACGGG 
AC: TCCTTGCTCAGGATAGAGAGG 


TTCTT: CCAGATTTCCTAGATTACCATCATC 
AAGAA: TGAAATTGCAACCAGAATATCAC 


TC: AGATGGATTCACCTATTGTTCG 
6A: TCATCACHGCCAACCTCC 


TC: ATCCCTAACCCTCACGCC 
GA: GGTGTGGACAACTTTAGTGGC 


CO 


<NJ CO 


CO CO 


CO 

CO CO 


ITD CO 
CO CO 


OO 
CO CO 


CO 




HT (Exp) 


0.77 
0.52 
0.74 


0.09 
0.05 
0.09 


0.05 
0.01 
0.05 


in \r> 

05 Crj 

O o o 


0.78 
0.57 
0.74 


0.69 
0.67 
0.63 


0.62 
0.54 
0.56 


No. of 
alleles'* 
Range (bp) 


11 
237-279 


406-408 


172-181 


28 

200-258 . 


285-318 


385-394 


221-251 


Repeat unit'' 
Original allele 
Clone 


(TTTA)ll 
allele 267 
p797all 


(TTTG)2TTCG(TTTG)2 
allele 408 
dJ93N13 


(TTA)5 
allele 174 
dJ93N13 


(GT)17 
allele 212 
dJ93N13 


i 

>^ M 

E-« ^ t:-4 

O ^ l>- 

^ ^ ^ 
^ ^ 

CO 


(TC)IO 
allele 392 ' 
dJ1077I5 


(TC)26 
allele 243 
dJ1077I5 


Localization'' 


Tel. (114kb)/ DOB 
Cen. (33kb)/ DQBl 


Tel. (18kb)/ DQAl 
Cen. (26kb)/ DRBl 


Tel. (34kb)/ DQAl 
Cen. (16kb)/ DRBl 


1N:DRB1 


Tel. (49kb)/ DRBl 
Cen. (3kb)/ DRB3 


IN:TSBP 


IN:TSBP 


Micro- 
satellite* 


M2_4J3 


M2_4_37 


M2-3-22 


M2_2_36 


5' 


M2_2_46 


OO 

2 
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^ Naming of microsatellite markers consists of three parts; firstly M2 represents the 
name of temporary consensus genomic sequence, subsequent_25_35_4 and _5 indicate the 
numbers of nucleotides in repetition units, and the last part represents serial numbers. A 
total of 21 markers are listed. 
5 ^ Detailed locations are given according to the genome sequence in this region (The 
MHC sequencing consortium (1999) Nature 401, 921-923). The most adjacent telomeric 
(Tel.) and centromeric (Cen.) gene names, and their distances (kb) from each marker are 
indicated. 

^ Under repeat imits, the sizes of original alleles in the genomic sequences determined 
10 from cosmid, PAC, or BAC clones are given. At the bottom, the names of cosmid, PAC, 
or BAC clones are indicated. 

^ A total number of alleles detected in the present invention is given. Range (bp) 
indicates the size of range of all alleles at each microsatellite locus. 

^ HT(Exp), HT(Obs), and PIC represent expected heterozygosity, observed 
15 heterozygosity, and PIC (polymorphism information content), respectively. 

What is claimed is: 
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1 1 . An oligonucleotide primer, wherein said primer is capable of specifically 

2 hybridizing to a DNA having the sequence of the flanking regions of a microsatellite 

3 selected firom the group consisting of M2_4_9, M2_2_9, M2_2_12, M2_3_l 1, M2_2_20, 

4 M2_2_21 , M2_2_22, M2_2_23, M2_2_24, M2_4_25, M2_4_26, M2_2_29, M2_2_32, 

5 M2_4_32, M2_4_33, M2_4_37, M2_3_22, M2_2_36, M2_5_l 1 , M2_2_46, and 

6 M2_2_48. 

1 2. The oligonucleotide primer according to claun 1 , wherein the sequence of said 

2 primer is selected from the group consisting of SEQ ID NOs: 1-42. 

1 3 . A kit for determining the number of repeat units of a microsatellite selected from 

2 the group consisting of M2_4_9, M2_2_9, M2_2_l 2, M2_3_l 1 , M2_2_20, M2_2_21 , 

3 M2_2_22, M2_2_23, M2_2_24, M2_4_25, M2_4_26, M2_2_29, M2_2_32, M2_4_32, 

4 M2_4_33, M2_4_37, M2_3_22, M2_2_36, M2_5_l 1 , M2_2_46, and M2_2_48, the kit 

5 comprising a pair of oligonucleotide primers having the sequence of the flanking regions 

6 of said microsatellite. 

1 4. Thekitaccordingtoclaim3, wherein the pair of oligonucleotide primers is 

2 selected from the group consisting of 

3 (a) SEQ ID NO: 1 and SEQ ID NO: 2, 

4 (b) SEQ ID NO: 3 and SEQ ID NO: 4, 

5 (c) SEQ ID NO: 5 and SEQ ID NO: 6, 

6 (d) SEQ ID NO: 7 and SEQ ID NO: 8, 

7 (e) SEQ ID NO: 9 and SEQ ID NO: 1 0, 

8 (f) SEQ ID NO: 1 1 and SEQ ID NO: 12, 

9 (g) SEQ ID NO: 1 3 and SEQ ID NO: 14, 

1 0 (h) SEQ ID NO: 1 5 and SEQ ID NO: 1 6, 

1 1 (i) SEQ ID NO: 17 and SEQ ID NO: 1 8, 

12 G) SEQ ID NO: 19 and SEQ ID NO: 20, 

13 (k) SEQ ID NO: 21 and SEQ ID NO: 22, 

1 4 (1) SEQ ID NO: 23 and SEQ ID NO: 24, 

1 5 (m) SEQ ID NO: 25 and SEQ ED NO: 26, 
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16 (n)SEQIDNO:27andSEQIDNO:28, 

17 (o) SEQ ID NO: 29 and SEQ ID NO: 30, 

1 8 (p) SEQ ID NO: 3 1 and SEQ ID NO: 32, 

1 9 (q) SEQ ID NO: 33 and SEQ ID NO: 34, 

20 (r) SEQ ID NO: 35 and SEQ ID NO: 36, 

21 (s) SEQ ID NO: 37 and SEQ ID NO: 38, 

22 (t) SEQ ID NO: 39 and SEQ ID NO: 40, and 

23 (u) SEQ ID NO: 41 and SEQ ID NO: 42. 

1 5 . A method for determining the number of repeat units of a microsatellite, the 

2 method comprising a step for determining the number of repeat imits in the region of 

3 which DNA can be amplified by losing a pair of oligonucleotide primers selected from the 

4 group consisting of, 

5 (a) SEQ ID NO: 1 and SEQ ID NO: 2, 

6 (b) SEQ ID NO: 3 and SEQ ID NO: 4, 

7 (c) SEQ ID NO: 5 and SEQ ID NO: 6, 

8 (d) SEQ ID NO: 7 and SEQ ID NO: 8, 

9 (e) SEQ ID NO: 9 and SEQ ID NO: 1 0, 

1 0 (f) SEQ ID NO: 1 1 and SEQ ID NO: 12, 

1 1 (g) SEQ ID NO: 13 and SEQ ID NO: 14, 

12 (h) SEQ ID NO: 15 and SEQ ID NO: 16, 

1 3 (i) SEQ ID NO: 1 7 and SEQ ID NO: 1 8, 

14 (j) SEQ ID NO: 19 and SEQ ID NO: 20, 

1 5 (k) SEQ ID NO: 21 and SEQ ID NO: 22, 

1 6 (1) SEQ ID NO: 23 and SEQ ID NO: 24, 

1 7 (m) SEQ ID NO: 25 and SEQ ID NO: 26, 

1 8 (n) SEQ ID NO: 27 and SEQ ID NO: 28, 

19 (o) SEQ ID NO: 29 and SEQ ID NO: 30, 

20 (p) SEQ ID NO: 3 1 and SEQ ID NO: 32, 

21 (q) SEQ ID NO: 33 and SEQ ID NO: 34, 

22 (r) SEQ ID NO: 35 and SEQ ID NO: 36, 

23 (s) SEQ ID NO: 37 and SEQ ID NO: 38, 



23 



24 (t) SEQ ID NO: 39 and SEQ ID NO: 40, and 

25 (u) SEQ ID NO: 41 and SEQ ID NO: 42. 

1 6. A method for mapping of susceptibility genes for disease associated with HLA 

2 class II alleles, by using a microsatellite marker selected from the group consisting of 

3 M2_4_9, M2_2_9, M2_2_12, M2_3_l 1, M2_2_20, M2_2_21, M2_2_22, M2_2_23, 

4 M2_2__24, M2_4_25, M2_4_26, M2_2_29, M2_2_32, M2_4_32, M2_4_33, M2_4_37, 

5 M2_3_22, M2_2_36, M2_5_l 1 , M2_2_46, and M2_2_48, the method comprising: 

6 (a) determining the number of repeat units of said microsatellite, 

7 (b) estimating the allele frequencies of patients and controls, based on said number, and 

8 (c) comparing the allele frequencies of patients with those of controls. 

1 7, The method according to claim 6, the method comprising: 

2 (a) amplifying a region of microsatellite using the ohgonucleotide primer capable of 

3 selectively hybridizing to a DNA having a sequence of flanking regions of said 

4 microsatellite, 

5 (b) determining the number of repeat units of said microsatellite, 

6 (c) estimating the allele frequencies of patients and controls, based on the number, and 

7 (d) comparing the allele frequencies of patients with those of controls. 

1 8. A method for genotyping of a microsatellite allele selected from the group 

2 consisting of M2_4_9, M2_2_9, M2_2_12, M2_3_ll, M2_2_20, M2__2_21, M2_2_22, 

3 M2_2_23, M2_2_24, M2_4_25, M2_4_26, M2_2__29, M2_2_32, M2_4J2, M2_4_33, 

4 M2_4_37, M2__3_22, M2_2_36, M2_5_l 1 , M2_2_46, and M2_2_48, the method 

5 comprising: 

6 (a) amplifying a region of the microsatellite, and 

7 (b) determining the number of repeat units of said microsatellite. 

1 9. The method according to claim 7, wherein said amplifying is performed by using 

2 the oligonucleotide primer selected from the group consisting of SEQ ID NOs: 1-42. 
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ABSTRACT 

Novel polymorphic microsatellite markers in the human MHC class II region and 
methods for disease mapping and genotyping with said microsatellite markers are 
provided. Said microsatellite markers are useful in HLA-related research^ such as genetic 
mapping of HLA class II associated diseases, transplantation matching, population 
genetics, and identification of recombination hot spots as v^ell as linkage disequilibrium 
studies. 
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Figure 1 
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SEQUENCE LISTING 

<110> Hidetoshi, Inoko 
Gen, Tamiya 
Yasunari, Matsuzaka 

<120> NOVEL POLYMORPHIC MICROSATELLITE MARKERS IN THE HUMAN MHC CLASS II REGION 



<130> C1-X0011-US 

<160> 42 

<170> PatentIn version 3.0 

<210> 1 

<211> 19 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(23) 

<223> artificially synthesized primer sequence 



<400> 1 

gggattgatt ccaaaaccc 19 



<210> 2 

<211> 20 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(24) 

<223> artificially synthesized primer sequence 



<400> 2 

gagatcaaga ccatcctggc 20 



<210> 3 
<211> 19 
<212> DNA 
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<213> Artificial /Unknown 
<220> 

<221> niisc_feature 

<222> (1)..(21) 

<223> artificially synthesized primer sequence 



<400> 3 

tgtttgccag gaactgtgc 



<210> 4 

<211> 19 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(21) 

<223> artificially synthesized primer sequence 



<400> 4 

actatgcagc atccaaggc 



<210> 5 

<211> 21 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(23) 

<223> artificially synthesized primer sequence 



<400> 5 

ttgcaaatac gatgtcgaag g 



<210> 6 

<211> 22 

<212> DNA 

<213> Artificial /Unknown 



<220> 
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<221> misc_feature 
<222> (1)..(24) 

<223> artificially synthesized primer sequence 



<400> 6 

aaacctccta acctctgtga cc 



<210> 7 

<211> 22 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> inisc_feature 

<222> (1)..(25) 

<223> artificially synthesized primer sequence 



<400> 7 

gaggtcagct tcctcaaata gg 



<210> 8 

<211> 22 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(25) 

<223> artificially synthesized primer sequence 



<400> 8 

cccacacctg taatcttagt gc 



<210> 9 

<211> 24 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..{26) 

<223> artificially synthesized primer sequence 



4/14 



<400> 9 

ccactcccat cttatagttg tgtc 



<210> 10 

<211> 18 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> niisc_feature 

<222> (1)..(20) 

<223> artificially synthesized primer sequence 



<400> 10 

aattccattc gcccagag 



<210> 11 

<211> 22 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> inisc_feature 

<222> (1)..(24) 

<223> artificially synthesized primer sequence 



<400> 1 1 

ccaatgtttg atagcagact gg 



<210> 12 

<211> 25 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(27) 

<223> artificially synthesized primer sequence 



<400> 12 



5/14 



cctagagatt cctccgtatt agttc 



<210> 13 

<211> 22 

<212> DMA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(24) 

<223> artificially synthesized primer sequence 



<400> 13 

ggagacacat tcaaaccata gc 



<210> 14 

<211> 24 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(26) 

<223> artificially synthesized primer sequence 



<400> 14 

caattggtga catacatcaa cttg 



<210> 15 

<211> 21 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(23) 

<223> artificially synthesized primer sequence 



<400> 15 

ttgcatacac tctgaagcag c 
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<210> 16 

<211> 21 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(23) 

<223> artificially synthesized primer sequence 



<400> 16 

tccctgtgga tgtcaagaat c 



<210> 17 

<211> 19 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(21) 

<223> artificially synthesized primer sequence 



<400> 17 

gaatggatgc tgcatgagg 



<210> 18 

<211> 22 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(24) 

<223> artificially synthesized primer sequence 



<400> 18 

aagtgttgaa ggaactccct gc 



<210> 19 
<211> 20 
<212> DNA 



7/14 



<213> Artificial /Unknown 
<220> 

<221> niisc_feature 

<222> (1)..(24) 

<223> artificially synthesized primer sequence 



<400> 19 

tcactcatgg ttgcttttcc 



<210> 20 

<211> 22 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(26) 

<223> artificially synthesized primer sequence 



<4G0> 20 

gaatgatagg agtccattgt gg 



<210> 21 

<211> 21 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(25) 

<223> artificially synthesized primer sequence 



<400> 21 

ttgtggtttc agctactcag g 



<210> 22 

<21t> 22 

<212> DNA 

<213> Artificial /Unknown 



<220> 



8/14 



<221> misc_feature 
<222> (1)..(26) 

<223> artificially synthesized primer sequence 



<400> 22 

ttctttcatt tggcctctac tg 



<210> 23 

<211> 22 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(24) 

<223> artificially synthesized primer sequence 



<400> 23 

taccttatca ttaccggaat gc 



<210> 24 

<211> 20 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(22) 

<223> artificially synthesized primer sequence 



<400> 24 

cgctggacca gaaagttagg 



<210> 25 

<211> 20 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(22) 

<223> artificially synthesized primer sequence 



9/14 



<400> 25 

ggcagcagaa tgagactctg 



<210> 26 

<211> 18 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(20) 

<223> artificially synthesized primer sequence 



<400> 26 

acgtcccatg aggacagg 



<210> 27 

<211> 20 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> niisc_feature 

<222> (1)..(24) 

<223> artificially synthesized primer sequence 



<400> 27 

gttctggaga tctgtggtgg 



<210> 28 

<211> 19 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(23) 

<223> artificially synthesized primer sequence 



<400> 28 
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ggactccagt ttcaatgcc 



<210> 29 

<211> 22 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(26) 

<223> artificially synthesized primer sequence 



<400> 29 

tcattatccc cagttcaatg ac 



<210> 30 

<211> 19 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(23) 

<223> artificially synthesized primer sequence 



<400> 30 

gggacagagc gagactctg 



<210> 31 

<211> 25 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(29) 

<223> artificially synthesized primer sequence 



<400> 31 

aatgaggtaa tataggaagc agtgg 
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<210> 32 

<211> 19 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(23) 

<223> artificially synthesized primer sequence 



<400> 32 

tttgttcctg gtctcgctc 



<210> 33 

<211> 21 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> inisc_feature 

<222> (1)..(24) 

<223> artificially synthesized primer sequence 



<400> 33 

tgcacataga gagctccaat c 



<210> 34 

<211> 18 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(21) 

<223> artificially synthesized primer sequence 



<400> 34 

aggcaggagg tttgcttg 



<210> 35 
<211> 20 
<212> DNA 



12/14 



<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(22) 

<223> artificially synthesized primer sequence 



<400> 35 

actgcagaca caactacggg 



<210> 36 

<211> 21 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> nnisc_feature 

<222> (1)..(23) 

<223> artificially synthesized primer sequence 



<400> 36 

tccttgctca ggatagagag g 



<210> 37 

<211> 25 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(30) 

<223> artificially synthesized primer sequence 



<400> 37 

ccagatttcc tagattacca tcatc 



<210> 38 

<2ir> 23 

<212> DNA 

<213> Artificial /Unknown 



<220> 
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<221> misc_feature 
<222> (1)..(28) 

<223> artificially synthesized primer sequence 



<400> 38 

tgaaattgca accagaatat cac 



<210> 39 

<211> 22 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(24) 

<223> artificially synthesized primer sequence 



<400> 39 

agatggattc acctattgtt eg 



<210> 40 

<211> 19 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(21) 

<223> artificially synthesized primer sequence 



<400> 40 

tcatcacttg ccaacctcc 



<210> 41 

<211> 18 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> misc_feature 

<222> (1)..(20) 

<223> artificially synthesized primer sequence 
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<400> 41 

atccctaacc ctcacgcc 



<210> 42 

<211> 21 

<212> DNA 

<213> Artificial /Unknown 
<220> 

<221> inisc_feature 

<222> (1)..(23) 

<223> artificially synthesized primer sequence 



<400> 42 

ggtgtggaca actttagtgg c 
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